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Abstract. The Vector Spectromagnctograph (VSM) of the NSO's Synoptic 
Optical Long-Term Investigations of the Sun (SOLIS) facility is now operational 
and obtains the first-ever vector magnetic field measurements of the entire vis- 
ible solar hemisphere. To fully exploit the unprecedented SOLIS/VSM data, 
however, one must first address two critical problems: first, the study of solar 
active regions requires an automatic, physically intuitive, technique for active- 
region identification in the solar disk. Second, use of active-region vector mag- 
netograms requires removal of the azimuthal 180°-ambiguity in the orientation 
of the transverse magnetic field component. Here we report on an effort to 
address both problems simultaneously and efficiently. To identify solar active 
regions we apply an algorithm designed to locate complex, flux-balanced, mag- 
netic structures with a dominant E-W orientation on the disk. Each of the disk 
portions corresponding to active regions is thereafter extracted and subjected to 
the Nonpotential Magnetic Field Calculation (NPFC) method that provides a 
physically-intuitive solution of the 180°-ambiguity. Both algorithms have been 
integrated into the VSM data pipeline and operate in real time, without human 
intervention. We conclude that this combined approach can contribute meaning- 
fully to our emerging capability for full-disk vector magnetography as pioneered 
by SOLIS today and will be carried out by ground-based and space-borne mag- 
netographs in the future. 



1. Introduction 

SOLIS is a state-of-the art ground-based facility dedicated to the study of the 
Sun and its magnetic atmosphere for decades to come. It consists of three instru- 
ments - the vector spectromagnetograph (VSM), the Integrated Sunlight Spec- 
trometer (ISS), and the Full-Disk Patrol (FDP). An overview on each instrument 
can be found in Keller, Harvey, & Giampapa (2003). The VSM, in particular, 
operates from Kitt Peak since May 2004. Among other daily measurements, 
the VSM performs complete Stokes polarimetry at the Fe I 630.2 nm photo- 
spheric spectral line. An inversion of the Stokes images provides full-disk vector 
magnetograms of the solar photosphere (see Henney, Keller, & Harvey, 2006, 
for details). The VSM data are the first spectrographic full-disk vector mag- 
netograms ever obtained. Partial-disk vector magnetography is performed by a 
handful of ground-based instruments and, recently, by the spectro-polarimeter of 
the Solar Optical Telescope (SOT) on board the Hinode satellite (Lites, Elmore, 
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& Streander 2001). Hinode carries the first space-based vector magnetographs, 
while the first air-borne vector magnetograph was included in the Flare Gen- 
esis Experiment balloon payload (Rust, 1994). The first space-based full-disk 
vector magnetograms will be acquired by the Helioseismic and Magnetic Im- 
ager (HMI; Scherrer 2002) on board the imminent Solar Dynamics Observatory 
(SDO). From the above series of groundbreaking developments, one feels that 
solar vector magnetography is probably entering its golden era. Therefore, it 
is both essential and timely to address and efficiently solve the core problems 
associated with it in order to fully exploit the unprecedented existing and future 
vector magnetogram data. 

Vector magnetic field measurements inferred by the Zeeman effect suffer 
from an intrinsic azimuthal ambiguity in the orientation of the transverse mag- 
netic field component (Harvey, 1969). Indeed, the properties of the transverse 
Zeeman effect remain invariant under the transformation <j) — > <j) + ir, where (j) is 
the azimuth angle. The problem of the 180°-ambiguity has proved notoriously 
difficult to solve self-consistently, despite numerous attempts (for an overview, 
see Metcalf et al., 2006, and references therein). For full-disk magnetograms, an 
additional problem appears in case active regions should be studied. How are 
active regions to be extracted from the full-disk measurements? This task may 
appear trivial if performed manually, but it is quite daunting in case voluminous 
data are to be processed. Moreover, applications intimately related to active 
regions and their magnetic evolution such as, for example, magnetic helicity cal- 
culations or major flare forecasting, require a robust definition of active regions 
and their spatial extent. 

In the following, we discuss our proposed solutions to both active region 
identification and azimuth disambiguation. Both techniques have been inte- 
grated into the VSM data pipeline and are operating in real time owing to the 
modest computational resources they require. Their operation is smooth and 
their solutions are robust, and this holds promise for automatic application to 
future full- or partial-disk vector magnetogram data. 

2. VSM Data Processing 

2.1. Automatic Identification of Solar Active Regions 

Our active-region identification method does not require vector magnetograms. 
It was conceived by B. J. LaBonte and was first applied by LaBonte, Georgoulis, 
& Rust (2007) to full-disk line-of-sight magnetograms of the Michelson-Doppler 
Imager (MDI - Scherrer et al., 1995) on board the Solar and Heliospheric Ob- 
servatory (SoHO). The technique consists of five steps, namely: 

1 . Smooth the full-disk line-of-sight magnetic field Bp using a smoothing win- 
dow with linear size equal to one supergranular diameter (SGD), that is, 
~ 40", or ~ 30 Mm on the solar surface. In the smoothed image, (i) 
test for bipolarity, enhancing magnetic polarity inversion lines, and (ii) 
calculate the gradient between the two polarities, emphasizing their E-W 
orientation. From this information (smoothed image, bipolarity, orienta- 
tion) create an intensity image with enhancements corresponding to the 
active regions present on the disk (Figure la). 
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Figure 1. Automatic active- region identification in a full-disk VSM mag- 
netogram obtained on 2005 July 7, at around 15:14 UT. (a) Negative of the 
intensity image discussed in step 1. The centroid location for each qualify- 
ing intensity enhancement (see steps 2 and 3) is indicated by a white dot. 
Notice that some minor enhancements do not qualify for active-region asso- 
ciation, (b) The final result of the identification process. Each active region 
is enclosed by a square prescribed to an outer circle from each centroid, as 
described in step (e). The corresponding NOAA numbers are also provided. 



2. Coalign the intensity image with the actual magnetogram. Identify mag- 
netic flux concentrations that coincide with the image's enhancements and 
discard those with flux imbalance larger than a prescribed threshold. 

3. For each qualifying intensity enhancement, determine an intensity-weighted 
centroid (barycenter) . 

4. Match the location of each centroid with NOAA's Space Environment 
Center (SEC) archives to assign AR numbers to each centroid. For multiple 
numbers, choose the one with assigned location closest to the centroid. 

5. Starting from each centroid, determine the spatial extent (area) of the 
respective active region. 

Smoothing by 1 SGD provides a first clue of the existing active regions and their 
locations on the disk. The size of the smoothing window reflects the fact that su- 
pergranules are the fundamental convection cells responsible for the large-scale 
magnetic fields in the Sun and, therefore, for active-region formation (see, e.g., 
Leighton, Noyes, & Simon, 1962). The flux imbalance criterion points to the 
fact that active regions are mainly closed magnetic field structures. Nonetheless, 
the imbalance tolerance limit is an external variable and can be set at will. The 
dominant E-W orientation criterion helps avoid identifying accidental flux asso- 
ciations as active regions. The centroid calculation helps associate the identified 
flux concentrations with the standard NOAA active-region number database. 

The remaining task is the determination of the spatial extent of each active 
region. This is accomplished in five steps, namely: 
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(a) Draw two concentric circles around each centroid - an inner one with radius 
equal to 1 SGD and an outer one with radius equal to 6 SGD. 

(b) Define circles with increasing radii from 1 to 6 SGDs and find the average 
value of \B(\ on each perimeter. 

(c) Find the median of the average |I?£|-values between 5 and 6 SGDs from 
the centroid. Use this median as a 1 — a threshold. 

(d) Check the perimeter averages of \B(\ and stop at the circle where the 
average \B(\ falls below 2a, i.e., becomes smaller than twice the above 
median. 

(e) Add 1 SGD to the radius of this circle and draw the square prescribed 
to this radius. This square will outline the edges of the identified active 
region (Figure lb). 

We underline that our size calculation technique, namely, defining an annulus 
with internal and external radius of 5 and 6 SGDs, respectively, to define the 
1 — a threshold, precludes strong magnetic fields from being included in areas 
belonging to multiple active regions. Indeed, notice the overlapping between 
the areas of NOAA active regions (ARs) 10786 and 10788 in Figure lb. This 
common area does not include strong magnetic fields. This stems from the 
fact that if the annulus cuts through strong magnetic flux accumulations, then 
the 1 — a threshold will be higher and the average \Bg\ value will drop below 
2a at shorter distances from the centroid, thereby imposing a smaller spatial 
extent for the examined active region. We have run numerous tests with full- 
disk magnetograms to verify that the technique performs as expected in the vast 
majority of cases. 

2.2. Azimuth disambiguation 

Our azimuth disambiguation method is the Nonpotential Magnetic Field Cal- 
culation (NPFC). The technique was introduced by Georgoulis (2005) and was 
slightly refined (NPFC2) as discussed in Metcalf et al. (2006). In this work, it 
was tested against several other techniques and was found to reproduce success- 
fully the correct azimuth solution in highly nonpotential, noise-free, synthetic 
vector magnetograms. Moreover, it was among the fastest azimuth disambigua- 
tion techniques. In further comparisons, facilitated by a series of Azimuth Reso- 
lution Workshops, the NPFC2 method successfully reproduced the correct solu- 
tion in nonpotential synthetic vector magnetograms where various levels of noise 
were embedded. Overall, the NPFC2 (hereafter NPFC) method demonstrated 
its speed and efficiency even in cases with extreme noise levels, that eventually 
led to its integration to the VSM software package. 

The physics behind the NPFC method is simple and starts by noticing 
that any magnetic field configuration B can be decomposed into a current-free 
(potential) component B p and a current-carrying component B c , i.e. 

B = B P + B C . (1) 

If the magnetic structure is rooted in a lower boundary plane S and extends 
in the half-space above it, then B p and B c can be calculated on S provided 
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Figure 2. A graphical description of the NPFC algorithm. 



that the vertical (normal to S) component B z of the magnetic field B and the 
vertical component J z of the electric current density J (where J is calculated 
by Ampere's law), respectively, are known on S. For B p there are multiple 
calculation methods including Green's functions (Schmidt, 1964) and Fourier 
transforms (Alissandrakis, 1981). For B c , one utilizes the gauge conditions 
applying to S, and especially the fact that B c has only an azimuthal component 
on S, i.e., B Cz \s = 0. Assuming that B Cz does not vary significantly with height 
on S, i.e., (dB Cz /dz)\s = 0, B c can be calculated in Fourier space and then 
inverted into Cartesian space, i.e., 



B r 



Ah 



ik x 



A: 2 



(2) 



In Equation (2) we have k = k x + ky for the harmonic (k x ,k y ) and j z = 
(4tt/c)J z , while !F(r), F~ l {r) are the direct and inverse Fourier transforms of r, 
respectively, at (k x , k y ). Notice that the only assumption of the NPFC method 
is {dB Cz /dz)\s = 0. This assumption is quite reasonable because B Cz \s = 
and, for small length elements, one might expect that B Cz ~ slightly above S, 
unless the magnetic field lines undergo dramatic changes of orientation within 
the elementary height. Assuming that S is the plane of the magnetic field 
measurements, an unambiguous magnetic field B can be reconstructed on this 
plane from B p and B c . The problem, of course, is that both B z and J z , required 
to calculate B p and B c , respectively, are subject to the 180°-ambiguity and are 
not known a priori. This is where the numerical part of the NPFC method 
begins. 

Figure 2 provides a graphical representation of the numerical process. Initial- 
guess distributions of B z and J z are used to calculate the initial-guess B p and 
B c , respectively. The total field B is then reconstructed through Equation (1). 
The reconstructed field is compared to the two equally possible ambiguity solu- 
tions and is set equal to the solution that is closer to it, for each location of the 
measurements' plane. This interim B-configuration gives rise to new B z - and 
./^-solutions. From them, new B p - and B c -configurations are produced, and the 
algorithm proceeds to a new iteration. Convergence is judged by the number 
of strong-field vectors whose orientation changes from one iteration to the next. 
When no vectors (or a small number of vectors) are flipped for 10 consecutive 
iterations, the process stops and the latest reconstructed B-configuration is the 
suggested disambiguation solution. 





Figure 3. Example of the NPFC disambiguation solution. Shown are the 
hcliographic magnetic field components on the image plane, (a) The solution 
on part of a Hinode/SOT spectro-polarimeter vector magnetogram depicting 
NOAA AR 10930, obtained on 2006 December 11 between 13:53 UT and 15:15 
UT. The data features a sunspot complex separated by a strongly sheared 
magnetic polarity inversion line, enclosed by the white square. Tick mark 
separation is 20". (b) The disambiguation solution for the enclosed area. Tick 
mark separation is 5". In both images, the vertical magnetic field saturates 
at ±1000 G. Data courtesy of B. W. Lites. 



The entire calculation requires minimal computing resources and is com- 
pleted in a matter of minutes, even at an ordinary desktop workstation. Re- 
markably, the speed of the convergence does not depend on the complexity of the 
studied magnetogram but, rather, on its quality and linear size. In other words, 
the quality of the solution depends strongly on the quality of the measurements. 
To demonstrate this, we show in Figure 3 a disambiguated vector magnetogram 
from Hinode's SOT spectro-polarimeter that was inverted and kindly provided 
to us by B. W. Lites. These data are of exceptionally high seeing-free quality and 
nearly unsurpassed spatial resolution (~ 0.16" per pixel). The depicted active 
region (NOAA AR 10930) shows a significant degree of complexity, especially 
along a strongly sheared magnetic polarity inversion line. The magnetogram's 
field of view had a very large linear size, namely, 1904 x 1024 pixels. Yet the 
NPFC method required only ~ 20 rain and 34 iterations to converge. The re- 
spective computing time for typical linear dimensions of, say, 512 x 512 pixels, 
would not exceed 2 min. 

Figure 4 provides the NPFC disambiguation solutions for the active regions 
identified in the VSM magnetogram of Figure 1. Inspecting Figure 4, one might 
notice some inconsistencies in the orientation of the horizontal magnetic field. 
These imperfections should be mostly attributed to problems in the inference 
of the VSM azimuth angle. Indeed, the VSM measurements at the time of the 
observation (07/07/05) were still quite preliminary, with now-known issues not 
addressed in the data. The reason why we chose to show these data, nevertheless, 
is the high degree of activity in the solar atmosphere at the time of the observa- 
tion. Vector magnetograms obtained thereafter showed much less activity and 
fewer, simpler, active regions, although the inference of the azimuth angle was 
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Figure 4. The NPFC disambiguation solutions for the four active regions 
identified in the VSM magnctogram of Figure 1 . Shown are the heliographic 
magnetic field components on the image plane. The solutions are given for 
a part of the area assigned to each region, namely, the part containing the 
majority of the active-region magnetic flux. Tick mark separation in all images 
is 20". The vertical magnetic field in all images saturates at ±1000 G. 

drastically improved. In the near future, when solar activity will start stepping 
up toward the next maximum, the VSM will be fully equipped with sufficient 
hardware and software to reliably acquire and process massive amounts of data. 



3. Conclusions / Current status of the VSM vector magnetogram 
data 

Although VSM full-disk vector magnetograms at the Fe I 630.2 nm photospheric 
line are obtained since August 2003, these data have not yet been released to the 
solar and space physics communities. This is due to work currently underway 
on the last remaining issues that, however, must be addressed prior to releasing 
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any digital data. This being said, we are confident that the data will be available 
for unrestricted use very soon. 

Complete descriptions of VSM and its data archive can be found at 
http : / / solis . nso . edu/ solis_data . html . Among other information, this web 
page includes quick-look visualizations of both the latest VSM vector magne- 
tograms and the regularly updated list of the VSM vector magnetogram data 
archive. Featuring a user-friendly interface, one may view all the magnetic field 
components for each identified active region, together with the zenith and az- 
imuth angles of the disambiguated magnetic field vector. Upon the release of 
the VSM data, full Milne-Eddington inversion products will be available nearly 
24 hours after the quick-look data acquisition (Henney, Keller, & Harvey, 2006). 

As for the active-region identification and azimuth disambiguation tech- 
niques, this article demonstrates that they are general enough to be applied to 
both existing and future vector magnetograms, either in conjunction or indepen- 
dently. Partial-disk magnetograms (e.g., from Hinode) might be disambiguated 
using the NPFC method, while future SDO/HMI full-disk magnetograms might 
undergo a process similar to that described in §2 for the SOLIS/VSM data. 
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